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t^' Abstract. For a measure fi supported on a compact connected subset of a 

■^r , Euclidean space which satisfies a uniform d-dimensional decay of the volume 

of balls of the type 

aS"^ <fj.{B(x,S)) <I3S'' (1) 

we show that the maximal edge in the minimum spanning tree of n indepndent 
samples from /i is, with high probability 

(V] I _ /logn\i/'* 

1. Introduction 

The laws governing the behavior of the minimum spanning tree (MST) on points 
in the unit disk and baU have been thoroughly researched. See |S90j for a survey 
on the worst case. For the average case (or points taken randomly), extremely fine 
\^ I results are known. See jA96j for the central limit theorem for the total length, 

Csl ■ |ABNW99I IAB99J for the dimension of a typical path, ■AS04" chapter 6] for an 

■^ I "objective" approach and |iV for intriguing simulation results. See also the book 

^^ ' |MR96| for the strongly related continuum percolation. 

The same questions on other metric spaces, in particular on fractals, have seen 
much less research. In KLS06 we studied the worst case problem for the total 
weighted length. Here we switch to the average case and are interested in the 
d • length of the longest edge, which, by the greedy algorithm, is the same as the 

S I connectivity threshold i.e. the minimal number r such that the graph in which two 

points are connected if and only if their metric distance is < r, is connected. In the 
setting of a ball in M.'^ this is known to be, with high probability 

/ log n ^ 



> 

X 

where ~ means that the ratio of the two quantities is bounded between two absolute 
constants. We wish to extend this result to fractal sets. 

Clearly, to get any kind of estimate one has to assume the fractal is con- 
nected. Further, it is clear that some kind of regularity is needed. To see why, 
it might be instructive to consider the following example: in M^ take the set F = 
U°Zi{Ai U Bi), consisting of a set of "thick" vertical slabs Ai = ^, (2i^-i) ^ P' 1] 

connected by "thin" horizontal bridges Bi = (2i+i') ' A ^ [0j2~*]; see figure ^ 
Take the normalized Lebesgue measure on this set. Now take n random points 
and connect them by their MST. Since the bridges become thin very fast, there 
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will be no points in the very thin bridges that connect the slabs starting from 
[1/2 [log nj, 1/(2 [log nj - 1)] x [0,1]. Thus, the MST wiU contain edges that are 
> c/ log n long. This is despite the fact that this set has Hausdorff dimension 2 
and in fact is a monofractal (a degenerate multifractal spectrum) which indicates 
strong regularity in some sense. 




Figure 1. The set F (not to scale). 

Definition 1.1. The metric probability space {F, p, ^) is semi uniform of dim,en- 
sion d if there exists some numbers 6q and < a < /? < oo such that for every 
point X Cz F and for every < S < So, 

aS'^ <fi{B{x,S))<l3S'' 

where B{x,r) is the ball of radius r centered at x in the metric p. We call Sq, a 
and P the param,eters of the metric probability space. 

Examples for such a semi uniform space are the Cantor dust or the Sierpin- 
ski carpet, endowed with the metric of their embedding in R" and the natural 
probability measures [FSU]. We show that for such probability spaces the classi- 
cal asymptotic laws of maximal MST edge still hold with fractional powers, if the 
space is connected. For example, in the case of Sierpinski's carpet, with dimension 

d = log 8/ log 3, the longest edge of the MST is « ( ^-^ J with probabihty 1 - e 
for all e > 0. Formally, the statement is 

Theorem 1.2. Let F he a com,pact connected subset of R*^ and let p be a semi- 
uniform measure of dimension d on F. Then there exist two constants C > c > 
such that for all e > and for any m sufficiently large (depending on e), if 
Xi , . . . , Xm are independent samples from p then 

(1) If r > C(l2|IIi)i/d then the set [j^B{X^,r) is connected with probability 
1-e. 
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(2) if r < c( °s™ ^i/'^ then the set [J^B{Xi,r) is connected with probability e. 

Here and below we shall use the convention that c and C denote constants whose 
value might change from formula to formula and even inside the same formula, c 
and C might depend on d, a and (3. c will usually denote constants which are 
"small enough" and C constants which are "large enough" . 

2. Proof 

To prove the first part of the theorem choose S := {2 login /{am)) and let 
{B{pi, (5)}j Jj be a maximal set of disjoint balls with centers pi S F. Then for each 

loff Tfl 

Pr[X,eB{p„S)]>2^ . 
m 

Here and below Pr stands for probability with respect to the product measure 
Y[i=i M- Denote by A^ the event that there is at least one of Xi in each of B{pj, S). 

Lemma 2.1. For large enough m, event As occurs with probability 1 — e. 

Proof. Had all the measures of all the balls been equal, we could have immediatly 
used coupon collector. However, the measures of the balls are only equal up to a 
constant factor. Still, for all pj, 

2i^<Pr[5fe,.)]. 
m 

Take to/ (2 logm) equiprobable coupons and denote by A' the probability of picking 
at least one coupon out of A' after to trials. Then by coupon collector 

Fr[As] > Pr[A'] > 1 - e . D 

Lemma 2.2. Conditioned on As, [J^^ B{Xi,36) is connected. 

Proof. As is well known (and easy to see), the maximality of the family B{pi,6) 
implies that the family B{pi, 26) is a cover of F. Since we are given that the event 
As happens it follows that the set [jB{Xi,36) contains the set [jB{pi,26) and 
therefore it also contains F. Now the lemma follows directly from the definition of 
connectivity. Suppose to the contradiction that [JB{Xi,3S) is separated i.e. there 
exist two open sets U, V s.t. U U V = \J B{Xi, 36) and UnV ^ 9. Using U, V it 
follows that F is also separated, and therefore a contradiction. D 

2.1. With the first part of the theorem demonstrated, let us proceed to the second. 

This time choose S — I °| ™ J and have {B{pi,S)}^J-^' be a maximal set of 

disjoint balls with centers pi G F. Then for each Xi, 

a logjTi „ , , „, , logTTi 

-, ■ -^ < Pr X, e B{p„S)] < -^ . 
p 2to 2to, 

Denote by B^ the event that there are at least logm balls in {B{pj, S)} that contain 
exactly a single point Xi. Denote by Yj the indicator to the event that the ball 
B{pj, 5) contains exactly one point Xi and have Y ^^Yj be the random variable 
that counts the number of such balls in {B{pj, (5)}. It is not difficult to see that 

E(y,) > ^ . i^i!^ fi - ^^^\ ""' = 1^ (^^ + ,(1^ 

^ '' - 13 2 \ 2m J Vm \2p ^ ' 
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SO for m sufficiently large, 

E(r) > c^/TOlogm . (2) 

Lemma 2.3. For ni sufficiently large, 

Var(r) < C^ . 

Proof. For any 1 < i ^ j < N{d) examine the variables Yi and Yj. Denote qi = 
IJ,{B{pi,S)). Then, because qi < Clogm/m we have 

E(r,) = mg,(l - g,)'""' = mq, exp {{-q, + 0{qf))im - 1)) = 

' log^ m ^ 



= mqi exp —qim + O 

\ \ "* 

, \ /-, ^ /log ra 
— mqi exp(— ?7i(7i ) 1 + C 

V V '^ 

A similar calculation gives 

E(r,r,) = m{m - l)q^qJ{l - q^ - g,)"-^ 



2 



2 r ^ / 1 , ^ / log "^ 



= m g^gj exp(— mg,j — rnqj) ( 1 + O 
Combining both we get, for m sufficiently large, 

N(S) N(S) 

var(y) = E(y2) _ E(y)2 = ^ ^ E(y,y,) - E(y,)E(y,) < 

< y^ E(yj) + y^ rrt^qiq, exp(-TOg,j - mg,) • C < 



<E(y) + c— 5 — E(r) 



log "^^:,/,^^2 



m 
This ends the lemma by virtue of Q. □ 

Lemma 2.4. For large enough m, event B^ occurs with probability 1 — e. 
Proof. By Lemma [2 .31 and the Chebyshev inequality it follows that 



Pr 



|r-E(r)|>iE(r) 






Lemma 2.5. Conditioned on B5, lJjS(Xi, iminj((5)) is disconnected with proba- 
bility 1 — e. 

Proof. Given that B(pj,6) is a ball containing just a single point X^, it follows 
that there is a constant probability that this point will be in B{pj, ^S) which would 
mean that IJ^ B{Xi, ^S) is disconnected. Since we have logm (Bernoulli) trials the 
probability for at least one to succeed is 1 — e. This proves the lemma and the 
theorem. D 
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3. Future Work 

We believe it would be interesting to characterize the fractal sets that naturally 
induce a semi uniform probability space. Also, in terms of asymptotic analysis, it 
would be interesting to investigate other geometric properties of these spaces, such 
as minimal matching, TSP and power sum of MST edges. 
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